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We present several application of simple topological arguments in problems of Kolmogorov com- 
plexity. Basically we use the standard fact from topology that the disk is simply connected. It proves 
to be enough to construct strings with some nontrivial algorithmic properties. 

1 Introduction 

In this paper we show that a very simple and intuitive topological technique can be surprisingly effective 
in algorithmic information theory. We present several examples of "topological" proofs of existence of 
strings with some nontrivial algorithmic properties. We focus on technical aspects of the arguments, 
assuming that the reader is familiar with basic notions of algorithmic information theory such as Kol- 
mogorov complexity and the mutual information (see the classic textbook ifllO. 

Let us start with an example. Consider a string x that has complexity n (we consider plain complexity 
C(x), but this does not matter for now). We want to find a string y such that C(x\y) « n/2. This can be 
done as follows: consider the shortest description p of x; it has length n; let y be the first half of this 
description. Then it is easy to check that C(jc|)>) =n/2+ O(logn). 

However, if we want C(x\y) to be close to n/2 with (maximal possible) precision 0(1), one needs 
a different argument. Let us start with y = x (when C(x\y) ~ 0) and delete bits (say, at the end) one 
by one until we get y = A (the empty string) when C(x\y) ~ n). When the last bit of y is deleted, the 
conditional complexity C(x\y) changes by at most 0(1), so it cannot cross the threshold n/2 without 
visiting 0(l)-neighborhood of n/2. 

Topological arguments of this type can be used in two (and more) dimensions, though they become 
a bit less trivial. We provide several examples of this type in the following sections. 

2 Vyugin's result and its extensions 

M. Vyugin O has shown that for every n and for every string x with C(x) > 2n + 0(1) one can find a 
string y such that both conditional complexities C(x\y) and C(y\x) are equal to n+ 0(1). This is proved 
with a (rather ingenious) game argument. As we shall see in this paper, the condition C(x) > 2n is 
stronger than necessary; it is enough to assume that C(x) >n + clogn for some c. This can be shown 
(unless C(x) is not very large) by a simple topological argument. (The game argument still seems to be 
necessary if C(x) is really big compared to n.) 

Similar reasoning allows us to construct y such that both complexities C(x\y) and C(y\x) has prescibed 
values with 0( Imprecision, even if those values are different. (This question was discussed in Vyu- 
gin's paper 13, but no positive result of this type is given there except for the already mentioned case 
m = n + 0{\).) Again we need some restrictions that guarantee that prescribed values are not unreason- 
able large or small. Here is the exact statement. 
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Theorem 1. Let P be some polynomial. There exists a constant c such that for every string x and for all 
integers m,n such that 

• n + clogn < C(x) <P(n); 

• clogn < m < P(n), 

there exists a string y such that \ C{x\y) — n\ < c and \ C{y\x) —m\ <c. 

(Since C(^;|y) does not exceed C(x), we need C(x) to be greater than n; the "safety margin" of size 
C?(log?i) is assumed in the statement. We also require m and C(x) to be polynomially bounded.) 

Proof. Let p be the shortest description of x; it is a string of length C(x) . Consider also an incompressible 
string z of length sligthly greater than m, e.g., let |z| and C(z) be equal to 2m + 0(1). Moreover, we 
take z independent from p, so C(z\p) is also 2m + 0(1). (Random z has these properties with positive 
probability; in fact, some other properties of z will be needed, see below.) 

The string y is then constructed as (the encoding of) a pair (p' ,z') where p' and z' are prefixes of p 
and z (respectively); it remains to decide how long p' and z' should be. In other words, we have two 
parameters, \p'\ and \z'\, and the space of the parameters is a rectangle. Each point in this rectangle 
determines y = (p',z r ) and is mapped to the point (C(x\y), C(y\x)) at the right. We need to show that 
some point is mapped into (n,m) with 0(1) -precision. 

\z'\ C(y\x) 





C(x\y) 



Figure 1: Each pair in the left rectangle determines y = (p',z f ) and is mapped into a pair C(x\y), C(y\x) 
in the right one. 

To show this, we note that the mapping is "continuous" in the sense that neighbor points on the left 
are mapped into points at distance 0(1) on the righ{|] Indeed, C(u\v) changes only by 0(1) if u or v 
is changed by adding or deleting the last bit. Consider a path A-B-C-D-A that goes counterclockwise 
around the rectangle on the left; as we shall see, the image path on the right will go clockwise (with 
logarithmic precision) around the rectangle and makes one turn around the point (m,n). Then we can 
continuously transform the path on the left into one point (since rectangle is simply connected); if its 
image on the right never comes close to (m,n), we get a contradiction. 

Now we look closely at the path around the rectangle and its image. Note that our assumptions 
guarantee that logC(x) = &(\ogn), so we write just O(log) having in mind 0(log«) or 0(logC(x)); note 
also that O(logm) < O(log). 



1 Technically, Lipschitz property is a better name here. However, we will use discrete versions of topological results about 
continuous mappings, so we keep the name. 
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• Point A: here y = (A, A), so C(y\x) = 0(1) and C(x\y) = C(x) + 0(1), so the image is A' = 
(C(x),0) with 0( Imprecision. 

• Edge A-B: here y = (p',A). Then C(y\x) = 0(\og) since C(/?|jc) = 0(logC(x)) [the conditional 
complexity of a shortest description of x given x is 0(logC(x))], and the length of p' can be 
described by 0(log C{x)) bits, too. And C(x|y) is somewhere between and C(x) + 0(1). 

• Point B: here y = (p,A), so C{y\x) = 0(logC(x)) and C{x\y) = 0(1); the image is B' = (0,0) 
with 0(log)-precision. So the edge A-B is mapped into a path along A'B' going from A' to B' with 
0(log)-precision. 

• Edge B-C: here y = (p,z r ), so C(y|x) is somewhere between and 2m + O(log) (recall that the 
length of z' is between and 2m and requires O(logm) = O(log) bits; C(p\x) is also O(log)). On 
the other hand, C(x\y) = 0(1), since p determines x. 

• Point C: here y = (p,z), so C(x\y) = 0(1) and C(y\x) = O(log) + C{z\x) = 2m + 0(log). So the 
image is C' = (0,2m) with 0(log)-precision. 

• Edge C-D: here y = (p',z), so C(y|jc) = 2m + O(log) and C(x|y) is between and C(x) + O(log). 

• Point D: here y = (A,z), so C(y\x) = C(z\x) + 0{1) = 2m + 0(1) and C(x|y) = C(x|z) = C{x) + 
O(log) since x and z have only logarithmic mutual information. So the image of D is D' = 
(C(x),2m) with 0(log)-precision. 

• Edge D-A: here y = (A,z ; ), so C(y\x) is between and 2m + O(log), and C(x\y) is O(log) (note 
that z' can have only O(log) bit of additional information compared to z). 

This analysis shows that the path on the right follows the trajectory A'-B'-C'-D'-A' with O(log)- 
precision and therefore turns around the point (m,n) if this point is 0(log)-far from the boundary of the 
rectangle, and this is exactly what our assumption guarantees. □ 

3 Decreasing complexity by using an oracle 

Let a and b be two strings. They have some complexities C(a) and C(b). If a third string t is given, we 
can consider the conditional complexities C(a\y) and C{b\y) which are (in general) smaller that C(a) and 
C(b). Now the question: can we describe the pairs (C(a\y),C(b\y)) that can be obtained by choosing an 
appropriate value of yl We answer this question for the case when a and b have small mutual information, 
and the answer is simple: we can get an arbitrary pair (a,j3) such that < a < C(a) and < j8 < C(b) 
and a,j6 are not too close to the endpoints of the corresponding intervals (the distance is big compared 
to the logarithms of complexities and to the mutual information). 

Theorem 2. For some constant c the following statement holds: for every two strings a,b and for every 
integers a,j8 such that 

• a,p >c(logC(a) + logC(b)+I(a:b)); 

• a < C{a) - c(log C(a) + log C(b) + 1 (a :b)); 

• J8 <C(b)-c{logC(a)+\ogC(b)+I(a:b)), 

there exists a string y such that \ C(a\y) — a\ < c and \ C(b\y) — j8 1 < c. 

Note that this statement is evidently true if instead of 0( Imprecision we are satisfied with a precision 
of 0(log C(a) + log C(b) +I(a :b)). Indeed, we can consider the shortest descriptions p and q for a and 
b and then let y = (p',q') where p' is p without a last bits, q' is q without j3 last bits; the information 
distance between a,p and between b,q is logarithmic, p' and q' are independent with our precision, etc. 
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Proof. To get 0(1) -precision, we need to combine the construction above with topological arguments 
similar to the proof of Theorem[T] Consider the shortest descriptions p and q for a and b. Then \p\ = C(a) 
and \q\ = C{b). For every pair (u,v) of integers such that < u < \p\ and < v < \q\ consider 

y(u,v) = (p without u last bits, q without v last bits). 

As we have discussed, C(a\y(u,v)) and C(b\y(u,v)) are close to u and v respectively; the distance is 
0(logC{a)+logC(b)+I(a:b)). 

In other terms, let us consider the mapping 

(w,v) H- (C(a\y(u,v)),C{b\y(u,v)); 

it is defined on the rectangle [0,C(a)] x [0,C(j)] and is close to the identity mapping, the distance is 
0(log C(a) +log C(b) +I(a : b)). This mapping is also continuous in the sense explained above. Now the 
topological argument can be used to show that the image 0(l)-covers the 0(logC(a) -\-logC(b) +I{a : 
b)) -interior of the rectangle. □ 

Remark. This argument can be generalized easily to three (or more) dimensions. For example, let 
us consider three strings a,b,c that are almost independent. In this case we get a mapping of a three- 
dimensional box to itself which is "continuous" and is close to identity. Then a topological argument 
(based on the fact that identity mapping of the two-dimensional sphere S 2 is not homotopic to the constant 
mapping) shows that the image of this mapping cover (with 0( Imprecision) the interior of the box. 



4 Combination with Muchnik's technique 

For the case when a and b are dependent, the result of Theorem [2] looks rather weak. We can extent the 
area of pairs (a, j8) that can be covered, if we combine the topological technique with an argument based 
on Muchnik's theorem on conditional description. 

Theorem 3. For some constant c the following statement holds: for every two strings a,b and for every 
integers a,f5 such that 

• a < C(a) — clogn, 

• /3 < C(b)- clogn, 

• -C(a\b) +clogrc < j3 - a < C(b\a) - clogn, 

(where n = C(a) + C(b)) there exists a string y such that \ C(a\y) — cc\ < c and \ C(b\y) — jS| < c. 

Proof. The proof is based on the following lemma. 

Lemma 1. For all strings a and b there exist strings a' and b' such that 

• C(a') = \a'\ = C(a) and C(b') = \b'\ = C(b), 

• C(a'\a) = 0(logn) and C{b'\b) = 0{logn), 

• l(a\ : b' m ) = max{0,/ + m — C(a,b)} + Oilogn) for each I = 1, . . . , \a'\ and m= 1, . . . , \b'\, 
where n = C(a) + C(b), and a\, b' m denote prefixes of string a' and b' of length I and m respectively. 

Proof of lemma: We use the method of conditional descriptions proposed by Andrei Muchnik. First, we 
apply Theorem 2 from [3] and get a conditional description of a given b and given the empty string. 
Muchnik's theorem guarantees that there exists a string a' such that 
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• \a'\ = C(a), 

• C(a'\a) = O(logn), 

• C(a\a') = 0(logn), 

• C(a|a",ft) = 0(log7i), where a" is the prefix of a' of length C(a\b). 

Then, we apply the same theorem of Muchnik again: we construct a description of ft conditional on 
\a'\ + 1 strings a' ,a\,..., a'<,i . We get b' such that 

• \b'\ = C(b), 

• C(b'\b) = O(logn), 

• C^ftla'^ft^) = 0(log?i) for all / < \a'\ and m < \b'\ such that l + m = C(a,b). 

The constructed a' and ft' satisfy the theorem. □ 

To prove the theorem we need to combine the argument from Theorem [5] and Lemma [T] Let a' and 
b' be the strings from Lemma. For every pair (l,m) of integers such that < / < \a'\ and < m < \b'\ we 
define 

y(l,m) = (first / bits of a, first m bits of ft'). 
Similarly to the proof of Theorem [2] we consider the mapping 

(l,m)^(C(a\y(l,m)),C(b\y(l,m)), 

which is defined on the rectangle [0, C(a)] x [0, C(ft)]. 

m C{b\y) 




C{a\y) 



Figure 2: Each pair of integers on the left determines y = (a'^b^), which is mapped to a pair 
(C(a\y),C(b\y)) on the right. 

This mapping is "continuous" (neighbor points mapped into points at distance 0(1)). Consider the 
path U-V-W-X1-X2-Y-Z-U that goes counterclockwise around the pentagon on Fig. [2] We will show 
that the image path will go clockwise (with logarithmic precision) around the hexagon U'-V'-W'-X'- 
Y'-Z'-U' on Fig. [2] This path makes a turn around the point (a, /3). Hence, point (a, j8) (more precisely, 
some point in 0(l)-neighborhood of (a, j3)) has a pre-image (l,m) in the rectangle [0, C(x)] x [0, C(y)]. 

Let us look closely at the path around the pentagon and its image. 
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• m = and 1 = 0... C{a\b): the image goes along U'—V' with 0(log7i)-precision; 

• m = and / = C(a\b) . . . C(a): the image goes along V'-W' with 0(log?i)-precision; 

• I = C(a) and m = 0. . . C(ft|a): the image goes along W'-X' with 0(logn)-precision; 

• / = C(a) — X andm = C(b\d) +A for A = 0. . .I(a : b): the image remains in 0(log ^-neighborhood 
ofX'; 

• m = C(b) and / = C(a\b) . . .0: the image goes along X'-Y' with C?(log«)-precision; 

• 1 = and m = C(b) . . . C{b\a): the image goes along Y'-Z' with 0(log«)-precision; 

• / = and m = C{b\a) ... 0: the image goes along Z'-U' with 0(logn) -precision; 

Thus, the path on in the image follows the trajectory U'-V'-W'-X'-Y'-Z'-U' with O (log) -precision. 
Therefore turns around the point (ot, jS) if this point is 0(log?i)-far from the boundary of the hexagon, 
and this is exactly what our assumption guarantees. □ 

Remark. Instead of the path U-W-Xi-X 2 -Y-U we could take another path U-W-X 3 -Y-U. The 
shortcut X1-X2 is equivalent to the longer path X1-X3-X2 since all points of the triangle X1X2X3 are 
mapped into O (log n) -neighborhood of X' (if / + ra > C(a,b), then y(l,m) contains enough information 
to reconstruct both a and b). 

From Theorem|3]it follows in particular that if C{a\b) and C{b\a) are not logarithmically negligible, 
one can cut by factor 2 the complexities of a and b while adding an oracle. 
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